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ABSTRACT 

Let  X-|,...,Xn  be  iid  F and  let  Y ,...,Yn  be  independent  (and 
independent  also  of  X^,...,Xn)  random  variables.  For  i = l,...,n,  set 
6.  = 1 if  X.  < Y.  and  =0  if  X..  > Y..,  and  = min{X^,YJ.  Then 

assuming  that  F is  distributed  according  to  a Dirichlet  process  (Ferguson, 
(1973)  Ann.  Stat.  2,  209-230)  with  parameter  a,  the  authors  (1976),  J.  Amer. 

A 

Stat.  Assoc.,  61,  December)  obtained  the  Bayes  estimator  F^  of  F under 
the  loss  function  L(F,F)  = /(F(u)-F(u))2dw(u)  using  (6-j ,Z^ ),..., (6n»Zn) . 
Now  let  X-j , . . . ,Xn  be  iid  FQ  and  , . . . , Y^  be  iid  G where  both  FQ 
and  G are  unknown  continuous  distributions.  In  this  paper,  it  is  shown 


~ _ I 

that  Ffi  is  mean  square  consistent  with  rate  0 ( n ) and  almost  sure 
consistent  with  rate  0(log  n//iT).  Also,  it  is  established  that 

A 

{F^(u)|0  < u <_  T),  T < ®,  converges  weakly  to  a Gaussian  process  whose  co- 
variance  structure  coincides  with  the  limiting  covariance  structure  obtained 
from  Kaplan-Meier' s ((1958),  J.Amer.  Stat.  Assoc.,  53,  457-481)  product 
limit  estimator  by  Breslow  and  Crowley  ((1974),  Ann.  Stat.  2,  437-453). 
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LARGE  SAMPLE  THEORY  FOR  A BAYESIAN  NONPARAME'f’RIC 
SURVIVAL  CURVE  ESTIMATOR  BASED  ON  CENSORED  SAMPLES 

V.  Susarla^'^'^  and  J.  Van  Ryzin^'^'^ 

1.  INTRODUCTION  AND  SUMMARY 

Recently,  attention  has  been  drawn  to  the  consideration  of  obtaining 
nonparametric  Bayes  estimates  of  a distribution  function  assuming  a manageable 
prior  (resulting  in  a manageable  posterior  distribution)  on  the  space  of  dis- 
tribution functions  F on  R = (-co,co).  Towards  this  goal,  Ferguson  [4] 
introduced  a class  of  priors,  known  as  Dirichlet  process  priors,  on  F which 
enjoy  the  property  that  the  posterior  distribution  is  again  a Dirichlet  process. 
Ferguson  used  this  fact  to  obtain  the  Bayes  estimator  of  the  right  sided 
cumulative  distribution  function  F (F(x)  denotes  the  probability  in  (x,°°) 
and  this  useful  convention  is  borrowed  from  Efron  [3])  under  a weighted 
squared  error  loss  function.  It  can  be  readily  seen  that  this  Bayes  estimator 
of  F will  have  all  the  asymptotic  properties  enjoyed  by  the  maximum  likeli- 
hood estimator  of  F if  there  is  no  prior  on  F and  the  observations  are 
i.i.d.  with  an  unknown  right  c.d.f.  Fq. 

An  important  problem  in  survival  analysis  is  that  of  estimating 

either  parametrically  or  nonparametrically  the  survival  curve  P(X  > x)  = F(x). 

(See,  for  example.  Gross  and  Clark  [6]).  While  treating  this  problem  of 

estimating  survival  curves  based  on  incomplete  data,  the  authors  [11]  obtained 

the  Bayes  estimator  of  F under  a weighted  squared  error  loss  function  when 

the  independent  observations  from  F are  randomly  censored  on  the  right 

under  Dirichlet  process  priors  of  Ferguson  [4].  They  demonstrated  that  this 

Bayes  estimator  is  an  extension  of  the  above  mentioned  Bayes  estimator  of 

Ferguson  [4]  and  in  a certain  sense,  also  of  the  well-known  Kaplan-Meier 

(KM)  estimator  [7]  which  maximizes  the  likelihood  of  the  observations. 

Efron  [3]  and  in  a more  detailed  manner,  Breslow  and  Crowley  [1]  showed 

that  the  KM  estimator  is  weakly  consistent  and  asymptotically  normal 
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under  the  assumption  that  all  the  censoring  random  variables  are  i.i.d. 
continuous  random  variables. 

The  object  of  this  paper  is  to  show  that  our  Bayes  estimator  has  good 
limiting  properties  including  mean-square  consistency  (m.s.c.),  almost  sure 
consistency  (a.s.c.)  and  asymptotic  normality  assuming  that  the  observations 
are  i.i.d.  with  right  c.d.f.  FQ  and  that  the  censoring  random  variables 
are  i.i.d.  with  a continuous  distribution  function.  Efron  [3]  and  Breslow 
and  Crowley  [1]  have  neither  rate  of  convergence  results  for  their  weak 
consistency  nor  do  they  have  m.s.c.,  while  we  obtain  rates  for  both  m.s.c. 
and  a.s.c.  Our  methods  of  proof,  in  contrast  with  those  of  Breslow  and 
Crowley  [1],  involve  the  analysis  of  the  expectation  and  the  variance  of  the 
logarithm  of  Wn(u)  involved  in  the  Bayes  estimator  given  in  (2.2). 

This  paper,  therefore,  generates  a new  class  of  estimators  of  Fq, 
one  for  each  parameter  of  the  Dirichlet  process,  when  the  observations  are 
censored  on  the  right.  Furthermore,  we  have  shown  that  each  member  of  the 
class  has  better  asymptotic  properties  than  those  established  for  the  KM 
estimator  by  Breslow  and  Crowley  [1].  For  results  of  this  type  in  para- 
metric cases,  see  the  bibliographies  of  Lindley  [8]  and  Shapiro  [9].  A 
justification  for  this  paper  is  given  on  page  615  of  Ferguson  [5].  In  his 
words,  "Bayes  rules  are  certainly  desirable  since  generally  they  are 
admissible  and  have  nice  large  sample  properties.  Therefore,  it  behooves 
the  statistician  to  suggest  large  classes  of  easily  computable  Bayes  rules 
in  the  hope  that  users  may  find  some  rules  to  their  liking,"  especially  if 
the  Bayes  rules  have  the  usual  large  sample  properties  enjoyed  by  the 
maximum  likelihood  estimator. 

The  next  section  formally  describes  the  precise  statement  of  the 
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problem  and  needed  assumptions.  The  rest  of  the  sections  deal  with  various 
asymptotic  aspects  of  our  Bayes  estimator. 

2.  DESCRIPTION  OF  THE  PROBLEM  AND  SOME  NOTATION 

Let  X-|,...,Xn  be  a random  sample  from  a right  sided  c.d.f.  F 
with  F(0)  = 1 and  Yj,...,Y  be  another  random  sample  such  that 
(X1 X ) and  (Y^,...,Y  ) are  mutually  independent  of  each  other.  Set 

(2.1)  6.=[X.  <Yi]  and  Z.  = min{Xi , Y. } 

for  i = l,.,.,n  and  assume  that  1 - F be  distributed  according  to 
Dirichlet  process  with  parameter  measure  a on  the  Borel  a-field  B in 


(0,°°).  Then  the  Bayes  estimator  of  F,  under  the  loss  function. 


L(F.F)  = / (F(u)-F(u) )2dw(u) , is  shown  in  [11]  to  be 

0 


(2.2)  F (u)  = -!L(.U.)  5 

a _ /rv+\  . _ 


a(Z")  + N (Z.)  + X. 


[6.  = 0,  Z.  < u] 


a(R+)  + n i=l  | a(ZT)  + N+(Zi) 


= Bn(u)  Wn(u) 


where  N+(t)  = # of  observations  > t and  X.  = # of  observations  at  Z^ , 
i - 1 ,. . . ,n. 

The  main  results  of  this  paper  concern  the  asymptotic  behavior  of 
F^  under  the  following  assumptions: 

(Al)  X-j,...,Xn  are  i.i.d.  with  right  sided  c.d.f.  Fg,  a fixed 
unknown  distribution  on  (0,°°) 

(A2)  Y.|,...,Yn  are  i.i.d,  with  right  c.d.f.  G,  a fixed  unknown 

continuous  distribution  on  (0,°°). 


-3- 


Thus,  while  the  rule  under  consideration  is  a Bayes  rule,  the  asymptotic 


properties  of  are  obtained  in  a non-decision  theoretic  setup.  In  other 

A 

words,  we  obtain  the  asymptotic  behavior  of  as  an  estimator  of  Fq. 

Throughout  we  assume  that  for  a fixed  (but  otherwise  arbitrary)  u, 

(A3)  a(u)  > 0. 

Since  it  is  not  possible  to  estimate  Fq(u)  whenever  G(u)  = 0,  we  assume 

throughout  that  G(u)  > 0 without  further  reference. 

In  the  hope  of  reducing  repetition,  a right  sided  c.d.f.  is  referred 
to  as  simply  a distribution  and  the  assumptions  stated  above  are  assumed 
throughout  the  paper. 

3.  MEAN  SQUARE  CONSISTENCY  WITH  RATES 

This  property  of  F can  be  studied  through  the  corresponding  one 
for  the  logarithm  of  wn(u)  of  (2.2).  Under  (A3),  one  obtains  that 

n fa(ZT)  + N+(Z . ) + 

(3.1)  In  W(u)  = X [6,  = 0,  Z.  < u]  In  J * 

" 1=1  1 1 “ |_a(ZT)  + N+(Z.) 

where  a(s')  = limit  of  a(t)  as  t t s.  Observe  that  (3.1)  (and  hence 

(2.2) )  is  well-defined  since  a(u’)  > 0 by  (A3).  This  property  is  not  enjoyed 
by  the  KM  estimate  which  is  not  always  well-defined  in  the  right  tail.  It 

is  precisely  this  property  of  converting  (2.1)  into  a sum  by  use  of  logarithms 
that  allows  us  to  obtain  stronger  convergence  results  than  do  Breslow  and 
Crowley  [1]  for  the  KM  estimator. 

For  dealing  with  the  expectation  and  the  variance  (and  properties 
based  on  these)  of  In  W , the  following  decomposition  which  follows  by  a 
logarithmic  expansion  of  the  summands  in  (3.1)  and  the  succeeding  lemmas 
will  be  extremely  useful: 
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(3.2) 


ln  Wn(u)  = Rn,l(u)  + Rn,2(u)  + Rn,3(u) 


where 

(3.3) 


nR  ,(u)  = J [6.  = 0,  Z.  < u]H_1(Z.), 

M » 1 j - 1 J J J 


(3.4) 

and 

(3.5) 

where 


R (u)  = l [<5.  = 0,  Z.  < u]  l r1(a(Z:)+HN+(Z.))’\ 
j = i J J ~ SL=2  3 3 


nR  (u)  = 1 [6  = 0,  Z.  < u]{n[(a(Z:)+l+N+(Z.)]'1-H'1(Zi)} 

j - 1 J J J J J 


(3.6)  H = FQG  . 

Lemma  3.1.  E[Rr  ^(u)]=  In  G(u)  and  nH2(u)  var(Rn  ^ (u ) ) £ 1. 

Proof.  Since  the  sumnands  of  R .(u)  are  identically  distributed, 

n 9 1 

E[RnJ(u)]  = E[[61  = 0,  Z1  <_  u]H“1(Z1)]  = - / F0(t)H_1  (t)dG(t)  = -In  G(u) 

by  the  definition  of  H in  (3.6).  The  variance  result  follows  since  each 

of  the  iid  summands  in  R ,(u)  is  bounded  by  H ^(u). 

n > i 

Lenina  3.2.  a2(u”)H‘*(u)(n^3)E[R2  2 (u ) ] £ n2(l+a(u~))2. 

n 

Proof.  Since  (a,+...+ a_)2  < n a2  for  any  real  numbers  a,,...,a„, 

in  ■*”  j i in 

and  since  for  fixed  n,  the  summands  of  R 7(u)  are  identically  distributed, 
n‘2E[R2,2(u)]  < E[[61  = 0,Z1  < u](  [ Jl"1 (a(Z“ )+l+N+(Z1 ) )"£)2] 

(3.7) 

<{  l (a(u")+1 )3”^}  E[(a(u")+l+N+(u) ) 4] 

~ 1=2 


-S- 


where  the  second  inequality  follows  by  bounding  the  series  by 

oo 

{ l (u(u  )+l)  Ha(u  )+l+N  ( u ) } and  by  dropping  the  indicator  function. 

1=2 

The  result  now  follows  from  the  following  inequality  and  (3.7) 

E[(a(u-)+l+N+(u))'4]  = (n"1)(a(u")+l+k)_4Hk(u)(l-H(u))n_1'k 

k=0  < 

1 H“4(u)V(^)Hk+4(u)(l-H(u))n+3-(k+4)/(nt3) 

k=0  4 

where  the  inequality  follows  since  k+i  _<  i(a(u")+k+l)  for  i = 1,2,3,  and  4. 

Lemma  3.3, 

a)  nH2(u)|E[Rn>3(u)]|  < a(R+)  + (l -H(u)  )n(n+a(R+)  )+2^ot(R+)  (l-H(u)1/2(n+l  )_1 

+ 2a(R+) (2+a(R+) ){n(n+l )}-1 

b)  (n+2)H2(u)E[R2j3(u)]  < 2{1 -H(u)+2(n+2)_1 (a2(R+)+(l -H(u) )2)}. 

Proof.  Since  the  summands  of  R Ju)  are  identically  distributed, 

li  ) J 

E[Rn,3(u)]  = EtCS^O.Z^uKntCZ*  ) +1+N+(Z1 ) )-H_1  (Z] )}] 

(3.8)  u 

= - /Fn(t)E[n(a(t")+l+K.)'1-H"1(t)]dG(t) 

0 1 

where  K^.  is  a binomial  random  variable  with  parameters  n-1  and  H(t). 

Now  observe  that  after  some  simplification, 

H(t)E[n(ot(t")+l+K.  )-1-H-1  ( t ) ] = "l  ("71 ) (k+1  )_1{nH(t)-a(f)-k-l } 

L k=0  K 

(3.9)  x Hk(t)(l-H(t))n_k_1+n^  (n"1)a(t"){k+l+a(t')-nH(t)} 

k=0  K 

x Hk(t)(l-H(t))n"1_k  {(k+1 ) (k+l+a(t") )}-1  = I + II. 

By  a rearrangement. 


-6- 


while 


-nH(t)I 


u(t')+(l-H(t) )n(nH(t)+a(t') ) 


11  i^x^nH^2)|k+2'(n+1)H(t)+H(t)+a(t")-1)|Hk(t)(1-H{t))n+1“(k+2) 

k ~ 0 

since  k+2  £ 2(a(t  )+k+l).  By  a change  of  variable  (k+2  = £)  and  by  using 
the  binomial  moments,  we  can  show  from  the  above  inequality  that 

n < 2a(t")  [{H(t)(l-H(t)l1/2  H(t)+a(t~)+l  i 

- H (t)  /Ff(n+1)  «»*'>'  J ■ 

This  bound  on  II  together  with  that  on  I,  (3.8)  and  (3.9)  give  the  first 

result  since  t £ u in  all  the  calculations  after  (3,8). 

n n 

Using  the  inequality  ( l a.)2  < n la2,  and  the  fact  that  the  summands 

i=l  1 ~ i=l  1 

of  R o(u)  are  identically  distributed  for  fixed  n,  it  can  be  shown  that 
n j j 

u _ -I  -j 

E[R2  3(^)1  £ - / Fg(t)E[(n(o<(t“)+Kt.+l )"  -H~  (t))2]dG(t)  where  Kt  is  as 

in  the  proof  of  a).  From  here  on,  the  proof  runs  parallel  to  that  in  a). 

Two  consequences  of  the  decomposition  (3.2)  and  the  above  three 

lernnas  are  given  below,  the  first  of  which  conserns  the  mean  square  consistency 

_1  * 

(m.s.c.)  of  In  W as  an  estimator  of  In  G while  the  second  one  concerns 
n 

A 

the  m.s.c.  of  Fa  of  (2.2)  as  an  estimator  of  FQ. 

Theorem  3.1.  Let  FQ(u)  > 0.  nE[|ln  Wn(u)-ln  G"^(u)|2]  is  bounded. 

Proof.  The  proof  is  a direct  consequence  of  the  decomposition  (3.2) 
and  the  above  three  lemmas. 

A 

Theorem  3.2,  Let  FQ(u)  > 0.  Then  nE[(Fa(u)-F0(u) )2]  is  bounded, 

where  the  bound  is  given  in  the  remark  below. 
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Proof.  Recalling  that  B and  W are  defined  in  (2.2),  we  obtain  by 
3 n n 

a ^-inequality, 

(3.10)  2'1(Fa(ii)-F0(u))2  < G‘2(u)(Bn(u)-H(o))2+B2(u)(Wn(u)-G'1(u))2 

where  we  used  the  equality  H = F^G.  Since  nBn(u)  can  be  approximated  by 
the  binomial  random  variable  N+(u)  whose  expectation  is  nH(u ) (=nF^(u )G(u) ) , 
we  obtain  that 

(3.11)  (a(R+)+n)2E[(Bn(u)-H(u))2]  = (a(u )-H(u )a(R+) ) 2+nH(u ) ( 1 -H(u) ) . 

In  W (u)  ) c f ) l 

After  writing  e n and  e n 2 u for  Wn(u)  and  G (u), 

respectively,  and  then  using  the  mean  value  theorem  leads  to  (Wn(u)-G  ^(u))' 

<_  1 1 n W (u)-ln  G”^  (u) ' ? (Wn(u)+G~^  (u ) )2  since  wp(u)  and  G ^(u)  _>  1.  Hence, 

(3.12)  E[B2(u)(Wn(u)-G-1(u))2]  < (1+G”1  (u))2E[  ) In  Wn(u)-ln  G'^u))2] 

since  Bn  and  BnWn  are  <_  1.  Using  the  fact  that  E[  | A+B+C | 2 ] <_  3(E[A2] 

+ E[B2]  + E[C2])  and  then  using  Lemmas  3.1,  3.2,  and  3.3  give  the  result 
in  view  of  (3.10)  - (3.12). 

Remark.  The  bound  in  Theorem  3.2  is  given  by 

Cn(u)  = 2(a(R+)+n)  2{n2H(u ) ( 1 -H ( u ) )+n(a(u) -a(R+)H(u) ) 2) 

+ 6(l+G"1(u))2H‘2(u){l+a'2(u‘)H"2(u)(nJ3)"1n(l4a(u“))2 
+ the  rhs  of  b)  in  Lemma  3.3  + the  square  of  the  rhs  of 
a)  in  Lemma  3.3). 

4.  ALMOST  SURE  CONSISTENCY 

A A 

Looking  at  the  estimator  F^  of  (2.2),  it  is  obvious  that  F^ 
converges  a.s.  provided  Wn(u)  does  likewise.  The  following  lenrna  concerning 
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the  almost  sure  behavior  of  R , , R „,  and  R 7 involved  in  In  W (u) 

n ) i n ) c n ^ j n 

of  (3.2)  is  essential  for  the  main  result  of  this  section. 

Lemma  4,1 . Let  F(J(u)  > 0.  Then 

a)  jRp  (u)  + In  G(u)  J = 0(--°9-  ) a.s. 

’ vn 

b)  |R  ?(u)  | = 0(^-^)  a.s. 

n,<:  /n 

c)  !Rn  ,(u)|  = Ot^3-1)  a.s. 

n’J  /x\ 

Proof.  a)  follows  from  the  first  part  of  Lemma  3.1  and  the  law  of  iterated 

logarithm  for  iid  random  variables,  b)  follows  from  Lemma  3.2  and  the 

-1  -2 

G1 i venko-Cantell i theorem  since  l n (log  n)  < «. 

n=2 

To  prove  c),  we  observe  that  jRn  ^(u)|  is  exceeded  by 


-i  1 

{n  l [6.  = 0,  Z,  <_  u]} 
j = l J J 


r I n 

sup  (| r 

■ t < u a(t  )+l+N  (t) 


hTET 


l> 


Since  the  expression  in  the  first  curly  brackets  is  bounded  by  unity,  it  is 
enough  to  show  that 


(4.1) 


sup  { (- 


o “t'i  u".(tW(t)  ‘ 

The  lhs  of  (4.1)  is  bounded  by 


■|)  = 0{l9a_!l)  a.s. 


v'n 


(4.2) 


a(u")+l+N+(u)  0 < t < u 


5UP  H(t,  - tafiLil 


Now  by  Lemma  1 of  Dvoretzky,  Kiefer,  and  Wolfowitz  [2] 


0 < t < u 
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for  some  absolute  constants  Cq  and  c.  Hence 

sup  11^-  H(t)|)  . O(^)  a.s. 

0 < t £ u /n 

oo 

_ p 

since  c(log  n)2  £ 2 log  n for  large  n and  l n < °°.  This  completes 

n=l 

the  proof  of  (4.1)  and  hence  also  of  c). 

Theorem  4.1.  Let  Fq(u)  > 0.  Then 

A 

F ^ ( u ) - FQ(u)  = 0(log  n//n)  a.s. 

Proof.  By  a triangle  inequality, 

(4.3)  |Fa(u)-FQ(u)|  < G‘1(u)|Bn(u)-H(u)|+Bn(u)|Wn(u)-G'1(u)|. 

The  first  term  on  the  rhs  is  0(log  log  n//n)  a.s.  since  ) (N+(u)/n)-H(u) | 

= 0(log  log  n//n)  a.s.  by  the  law  of  iterated  logarithm. 

As  in  the  proof  of  Theorem  3.2,  we  can  show  that 

(4.4)  Bn(u)|Wn(u)-G_1(u)|  £ (1+G'1(u))|ln  Wn(u)-ln  G_1(u)|. 

But  'In  W (u)-ln  G”\u)|  = 0(log  n//n)  a.s.  due  to  the  decomposition  (3.2), 
a triangle  inequality,  and  parts  a),  b),  and  c)  of  Lemma  4.1.  Thus  (4.3) 
and  (4.4)  complete  the  proof. 

5.  WEAK  CONVERGENCE  OF  F 

a. 

In  this  section,  we  consider  the  weak  convergence  of  {Fa(u)|0  < u £ T} 
where  T < «.  We  assume  throughout  this  section  that  H(T)  = Fq(T)G(T)  > 0. 

It  is  convenient  sometimes  to  suppress  the  dependence  of  the  functions  and 
to  let  ||  ||y  denote  the  sup  norm  over  (0,T]. 

The  discussion  to  follow  reduces  the  consideration  of 

A 

{ F ^ ( u ) 1 0 < u £ T)  to  a much  more  accessible  form.  To  start  with,  we  observe  that 
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(5  1) 
and  that 


where  c 
(5.2) 


Wa  - Fq)  = 

In  W 

Wn-G'(  = e n-e 

= G_1(ln  W 
is  between  0 and 

IW„-F0>-’®r'<Bn 


/n(B  -H)G_1  + /n(W  -G_1)B 
' n ' n ' n 


in  G 


-1 


In  G 


-1  In  W -In  G’ 


(e  n 


-1) 


, (In  W -In  G'1)2  „ r-l 

i_  n-1  \ . n ' _c  + In  G 

n ~ n G > + 2 e 

in  Wn  - In  G”\  Therefore,  from  (5.1),  we 


-H)  - /n  HG_1(1n  W -In  G'1 
v n 


1 ||T  || In  Wn-ln  G-'  ||T  IjG"1  llT  * f ( IP n M^-ln  G"'  ||T)’. 

The  purpose  of  the  following  lemma  is  to  show  that  the  rhs  of  (5.2)  -*•  0 

8 - 1 

a.s.  by  showing  that  np  ||ln  W -In  G"  ||j  ■+  0 a.s.  for  any  2B  < 1 . 


Lemma  5.1.  n,J  (jin  Wn  - In  G”^  ||j  ->  0 a.s.  for  any  26  < 1 . 

u 

Proof.  With  H (u)  = / FQ( t)d(l -G( t) ) = P[6]  = 0,  Z]  < u]  and  with 
~ n G 

nH  ( u ) = l [6.  = 0,  Z.  < u],  we  have 
n j=l  J J ~ 

II 1 n W -In  G"1  ||T  = ||  } n ln{l-(a(s")+l+H  (s)  )_1  }dH(s)-  /H_1(s)dH(s)  || 
n 1 0 0 

(5.3) 

||  / nfa(s“)  + l+nH  (s))_1dH  (s)-  / H"1 (s)dH(s)  ||T 
0 n n 0 

+ n (l+a(T~) )2 

(a(T")+l+nHn(T))2  (a(T~))2 

where  the  inequality  follows  by  a logarthmic  expansion  and  an  obvious  weakening 
of  the  series  from  the  second  term  onwards.  Observe  that  the  second  term  in 
the  rhs  of  (5.3)  ■+  0 a.s.  at  a rate  n~®  with  26  < 1. 
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For  the  first  term  on  the  rhs  of  (5.3), 

||  /n(ot(s“)+l+nH  ( s ) }_1  dH  (s)  - f H_1(s)dH(s)  ||T 
0 n n 0 1 

(5.4)  < ||/{n{a(s')+HnH(s)r1-H'1(s)}dHn(s)  ||-  + ||/H_1  (s)d(H  -H)(s)  |L 

0 n 1 0 


ll|/[n{a(s')+l+nHn(s)}'1-H'1(s)]dHn(s)  ||T  + 2H_1  (T)  ||Hn-H  ||T 

where  the  second  inequality  follows  by  applying  integration  by  parts  to 

/ H (s)d(H  -H)(s)  and  upon  observing  that  the  variation  of  H on  (0,*] 

0 n 

is  H \').  By  the  law  of  iterated  logarithm  for  iid  random  variables 
|| n f ot(  • ~)+l+nHn ( • ) }-H~^  ( • ) |L  -*•  0 a.s.  at  a rate  0(n  ,J)  with  23  < 1 

and  by  applying  the  argument  given  by  Singh  [10]  to  the  random  variables 
[6j  = 0,  Zj  <_  u],  j = l,...,n,  we  obtain  that  ||Hp-H  ||j  ■*  0 a.s.  at  a rate 
-8 

0(n  ) with  2B  < 1 . Consequently  the  rhs  of  (5.4)  and  hence,  the  rhs  of 
(5.3)  -*•  0 a.s.  at  a rate  0(n”®)  with  2B  < 1. 

In  view  of  (5.2),  an  easy  Corollary  to  the  above  lemma  is 

Corollary  5.1.  ||  /n  ( F^- Fq  ) - /nG  ^ (Bp-H)-v,nHG  ^(ln  Wp-ln  G ^)  ||j  0 a.s. 

at  a rate  0(n"S)  for  any  23  < 1 . 

By  following  the  method  of  proof  of  Lenina  5.1,  we  can  also  show  that 


Lemma  5.2.  ||/n(ln  W -In  G~^)-/n(/  H~^dH„  - /H-1dH)  ||T  ->  0 a.s.  at  a rate 

n 0 0 

0(n"^)  with  2B  < 1 . 
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Note.  The  random  integral  / H ^dfl  could  be  infinity,  but  finite  a.s. 

Q n n 

since  P[H(i(s)  - 9]  = P[«5.  = 0,  Zj  <_  s for  j = l,...,n]  <_  P[Zj  £ T for 

j = 1 n]  = ( 1 -H(T ) )n  for  all  s < T. 

Hence,  by  Corollary  5.1  and  Lemma  5.2,  we  can  study  the  weak  conver- 
gence of  {F  (u)jO  < u <_  T}  through  the  corresponding  one  for 

(5.5)  ,'nG'1(Bn-H)  + /nHG_1(  /H'1  dHn  - /H_1dH)  . 

The  following  theorem,  which  is  parallel  to  Theorem  3 of  Breslow  and 
Crowley  [1],  is  needed  in  the  study  of  the  weak  convergence  of  (5.5)  or 

A 

equivalently  that  of  f F ^ ( u ) 1 0 < u < T). 

Theorem  5.1  . Define  (P  ,Q  ) £ D(0,T)  * D(0,T]  (D(0,T]  is  the  space  of 
functions  on  (0,T]  with  jump  discontinuities)by  P n = /n(H-Hn)  and 
Qn  = /n(Hn-H).  Then  (P^.Q^)  converges  weakly  to  a bivariate  Gaussian 
process  (P,Q)  which  has  mean  0 and  a covariance  structure  given  for 


s < t by 


(5.6) 


Cov(P(s),P(t))  = H(t) (1 -H(s) ) , 

Cov(Q(s) ,0(t) ) = H(s) ( 1 -H( t) ) , 

Cov(P(s) ,Q(t) ) = H(s)-H(t)(l-H(s)), 

and  Cov(Q(s),P(t))  = H(s)H(t) 


where  H = FrG  and  H ~ / F_d ( 1 -G) . 
u o u 

As  in  (7.9)  of  Breslow  and  Crowley  [1],  we  can  represent  (5.5)  as 
(5.7)  (5.5)  ■ Wfc'X-H)  ♦ ♦ B„  ♦ R,>n  ♦ R2j„ 


where 


-1  i- 


f P H_2dH  , 

r\  n 


(5.8) 


Bn  * V''  -/OnH'2dO-H)  , 

R,  = n"1/2  f P2H‘2H"1dH  , 
l,n  q n n n 


and  R. 


I PnH-’H-’d(5n.5)(u) 


By  the  above  representation  for  (5.5)  and  steps  similar  to  Theorem  4 of 
Breslow  and  Crowley  [1]  , we  obtain  the  following  theorem  which  is  similar  to 
Theorem  5 of  Breslow  and  Crowley  [1], 

Theorem  5.2.  Let  T < °°  and  H(T)  > 0.  Let  Fq  and  G be  continuous. 

A 

Then  the  random  function  /n^F^-F Q ) on  (0,T]  converges  weakly  to  a mean 

0 Gaussian  process  R*  = -G~V  + / H 2PdH  + H~^Q  + f H~2QdH  with  covariance 

0 0 

structure  given  for  s £ t by 

-1  s -I 

Cov(R*(s) ,R*(t) ) = F0(s)FQ(t){H  1 (s) (1-H(s))+  / H 'g  ’dG) 

(5.9)  $ 0 

= FQ( s ) FQ ( t ) { / H~ 1 Fq1 d(l— Fq)}  . 

Remark  5.1  . The  covariance  calculations  involved  in  (5.9)  are  given  in 
the  Appendix.  We  notice  here  that  the  rhs  of  (5.9)  coincides  with  (7.13) 


of  Breslow  and  Crowley  [1], 


CONCLUDING  REMARKS 


There  are  three  small  sample  advantages  for  the  Bayes  estimator  (2.2) 
over  the  KM  estimator.  The  first  is  that  it  is  defined  everywhere  on  the 
real  line  for  any  n while  KM  estimator  is  not.  Secondly,  as  illustrated 
in  our  paper  [11],  the  Bayes  estimator  is  smoother  than  the  KM  estimator. 
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The  final  important  advantage  is  that  the  Bayes  estimator  is  an  admissible 

estimator  of  F provided  that  the  support  of  a = support  of  w = (0,°^) 

under  the  loss  function  L(F,F)  = / (F(u)-F(u))2dw(u)  and  under  the  weak 

0 

convergence  topology. 

The  results  of  this  paper  can  be  extended  to  the  case  in  which 
Y^,...,Yn  are  independent,  but  not  identically  distributed.  The  technique 

used  to  obtain  this  extension  is  different  from  the  ones  proposed  here  and 
will  appear  elsewhere. 
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APPENDIX:  COVARIANCE  STRUCTURE  OF  R OF  THEOREM  5.2 

★ 

To  study  the  covariance  structure  of  R , it  is  convenient  to  study 
the  covariance  structure  of 


R = F'V  = H-1P  + / H‘2PdH  + H_1Q  + /H'2QdH. 

0 0 

We  use  repeatedly  (5.6),  integration  by  parts  and  the  equalities  H = FqG 

and  dH  = d(  /Fnd(l-G))  = -FndG.  We  write  Cov(R(s),R(t)  = Var(R(s)) 

0 u 

+ Cov(R(s),R(t)-R(s))  for  0 < s <_  t < T,  show  that  Var(R(s))  = the 
expression  in  the  curly  brackets  of  (5.9),  and  that  Cov(R(s) ,R(t)-R(s))  = 0. 

VARIANCE  OF  R(s) 


(1) 


Var(H'1(s)P(s))  = H_1 (s)(l-H(s)) 


(2)  2Cov(H~1(s)P(s),H_1(s)Q(s)) 


-2H_1(s)H(s)  , 


(3)  2Cov(H_1(s)P(s),  /H~2PdH) 

0 


(4)  2Cov(H-1(s)P(s),  /QH_2dH) 

0 


2H"1 (s)  / 


dH 


Cov(P(u),P(s))  d~ 
H2(u) 


= -2  / ^ - 2 In  G(s) 


2 

HliT 


0 H 

s 

r 

0 


f Cov(P(s)_,Q(u) ) dH 
H2  (u ) 


= + 


2 Sjf} + 2 in  g(s)  * 


s u 


(5)  Var(/H_tPdH)  = 2 / / gov(p(.ul»pLr)I  dH(r)dH(u) 
0 00  H2(u)H2(r) 


= 2 / dH(r)  = 2/  ^1n  G^s^~1n -G^dG-ln2G( s 
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(6)  Var  (P-  (s)Q(s) ) 


fru“2nju  u~l/-\n/~\\  — ^ r 


(8)  2Cov(/H^PdH,H"'(s)Q(s))  = rrrry  / 

0 1 ^ 0 H2(u) 


s ,p  U 


|r  i / dH  - H(s)  / dH  + H(s)  / i dH 


(9)  2Cov(H'1(s)Q(s),  / H'2QdH)  = fov(Qlu-L 

0 [S>  0 H (u) 


s ~ s 

(10)  2Cov(/H'2PdH,  /H'2QdH) 


H2  (s) 


s u 

= 2 ; / 
0 0 


H2 (u)H2 (r) 


, 2 ) “ HlOH(u) 

0 0 H2 (u)H2  (r) 


^ dH(r)dH(u)  + 2 / / Cov(Plr.)-iQ(uJ 
0 0 H2 (u)H2(r) 

(u)dH(r)  + 2 / / ^r)-^u)0-Hlr.)j 
0 0 H2(u)H2(r) 


hHT0;  hg 


dG  + 2)  + 2H|||  ln  G(s)  . 2 / — 


0 H 


H(s 


0 


HG 


0 H : 


+ 2/  — dH  - 2 / 


_H 
0 H2 
s u 


(In  G(s)-1n  G(r)) 
HG 


dH  + 2 / 1n-  G-(s)~ln— G.LrJ  dG 


2/  — dH  + ln2G(s) . 


0 H2 

Adding  (1)  through  (10)  and  using  the  facts  that  H = F^G  and 

dH  = -FgdG  wherever  necessary,  we  obtain  the  expression  in  the  curly 
brackets  of  (5.9). 


dH 


COVARIANCE  OF  R(s)  AND  R(t)-R(s) 


There  are  16  terms  in  this  covariance  calculation  which  are  grouped 
below  into  5 sets  of  expressions.  The  sum  of  all  the  expressions  in  each 
group  will  be  equal  to  zero,  thus  showing  that  Cov(R(s) ,R(t)-R(s) ) = 0 
for  0 < s £ t < T. 

(*')  Cov(^.  fill}  - ™ - ° 

(A2)  Cov(£j|j  ./H"2PdH)  = H_1(s)/  g.0VA.p(.9-l»pCli)  dH  = G(s)-ln  G(t)) 

H(s)  s s H2  (u) 

(-)  Cov(^-g[f|)  =^{^-l}-HXtl^ 

(A4)  Cov(jN-,/H"2QdH)  = n7~r  / ^v(P^)-^-u-)-)-  dH  = H_1  (s )/  -H(S)'H(U) C 3 ~.H.Ls ) ,) 
H(s)  s H's'  s H2 (u)  s H2 (u) 


dH 


. H(s 


h ■ tttW)  - Wfi} - m 
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(Bl)  Cov(/H'?PdH,  - HTii} 


0 


Hit! 


Cov(P(u)  ,P(t) ) d~,  x . 1 / Cov(P(u),P(sJJ.  dH(u) 

H(s)  n H2 (u) 


H2  ( u ) 


; L^M  dH(u)  - ; dH(u)  = o . 

0 H2 (u)  0 H2 (u ) 


(B2)  cov(S[|j,  n{ry  - = 


Cov(Q(s),P(s))  _ q 
H?(s) 


fn-x\  rn  (fu'2nHu  P(t)  PM)  = 1 / Cov(P(u),q(.U)  dH(u) 

(B3)  Cov(/H  QdH,  HTsT  H ( tj Q H2(u) 


_ 1 } CoyM^flsR  dH(u) 

H(s)  o h2 (u) 


o 


(Cl)  Cov(/H-2PdH,/H-2PdH)  = f f dH(u)dH(r) 


0 


0 0 H2(u)H2(r) 


■ ji 


H-HjrU  d;(r)  / I dH(u ) * / (In  G(s)-ln  G(t))dH(r) 

...  ' ' H(U  ) r>  u2  t „\ 


0 H2(r) 


0 H2 (r) 


= (In  G(s)-ln  G(t) ) / dH(r)  . 


0 H2 ( r) 


s _?  oft)  0 ( s ) x 1 Sr  Cov(0(t)  ,P(u))  1 sr  Cov(Q(s)  ,P(u)I  HH(lJ) 

(C2)  Cov(/H  2PdH,  “ otly)  “ H^Tq  ' h*(u)  ^0  H2(u) 


HltT 


/ 


H(u)-H(t)(l-H(uU  d~(u)  . 1 ) H(u)-H(s)(1-H(u)1  d~(u) 

h(s)q  H2  (u ) 


0 H2 (u) 

f i dH  - ^ / M 

0 H2  H(t)  0 H2 


m 


_H 

0 H2 


H , 1-H 


dH  - rrrrv  / — dH  + n-  / ■ dH 


0 H‘ 
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(C3) 


0 


s t 


= f ) dH(r)dH(u) 

0 s H2 (u)H2 (r) 


, 1 1 i M + { 

= HTsT  ” HTtTo  H2 


(D„  Cov(B-’(s),(s),  ■ H-'(s)  ) £HlailUiu]ld'H(u, 

= y|||  (In  G(s)  - In  G(t))  . 


HU)  . H(i)  + m G(t)-ln  G(s)}/  ~ dH 
Htt)  HTiT  OH2 


(D2)  Cov(H‘1(s)Q(s),^ 


(t)  0(s! 

K COV< 

Sis) 

TTT  ‘ 

p " HI 

tsWt 

Cov(Q(s),Q(s)) 
H2  (s) 


Hfs)fl-H(t))  _ H(sK1-H(s))  < 
= ~H(s)H(t)  H2(s) 


n/.x  t ? 1 * CovCO(s)  .Q(u))  = H(s)  f VH  dH 

(»3)  Cov(2||>,  /H  zm)  - grry  ! H’ 


H ( s ) r 1 _ +ln  G(t:)-ln  G(s)}. 

= FitsT  hTsT  Hity+HitT  hitt 


(El ) Cov(/H"^QdH,/H”^PdH)  ■ / ) 


0 s H2 (r)H2 (u) 

= ) ) dH(r)dH(u) 

0 s H2 (r)H2 (u ) 

= -{-In  G(t)+ln  G(s)H§-[§|  + ln  G(s)l. 
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(E2) 


Cov(/H^QdH, 

0 


s 

; 

o 


Cov(Q(t),Q(u))  dH(u)  - 
H2  (u) 


T.  / Cov(Q(s)>q(uj,)  dH(u) 
huIq  h2  (u) 


1 -H(t)  S HdH  _ 1-H(s|  J _H  dH  > 
q h2  H 0 H2 


(E3) 


Cov(/H  2QdH/H_2QdH) 

0 s 


; ; Cov(Q(r),Q(uU  dH(u)dH(r) 
0 s H2 (u)H2 (r) 


J \J 


0 s H2(u) 


- r 


lhht 


i 

HTtT 


s 

} f 
0 


dH  + 


s 

(/ 

0 


•7  - $!}  * 'n  G(tH" G(s”' 
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Let  Xj,...,Xn  be  lid  F and  let  Y ,...,Y^  be  independent  (and 
independent  also  of  X^,...,X  ) random  variables.  For  i = l,...,n,  set 
cS.  = 1 if  X.  < Y.  and  =0  if  X.  >Yr  and  Z.  = min{Xi  ,Y . } . Then 


assuming  that  F is  distribuLod  according  to  a Dirichlet  process  (Ferguson, 
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Abstract  ( 2) 


(1973)  Ann.  Stat.  2,  209-230)  with  parameter  u,  the  authors  (1976),  J.  Amer. 

A 

Stat.  Assoc . , 61,  December)  obtained  the  Bayes  estimator  of  F under 

a a 

the  loss  function  L(F,F)  = /(F(u)-F(u))?dw(u)  using  (6^  ,Z^ ),...,  (6^  ,7_n ) . 

Now  let  X, ,...,X  be  iid  F*.  and  Y,  ,...,Y  be  iid  G where  both  Fn 

I n 0 I n 0 

and  G are  unknown  continuous  distributions.  In  this  paper,  it  is  shown 
that  F^  is  mean  square  consistent  with  rate  0(n  ) and  almost  sure 

consistent  with  rate  0(log  n//n).  Also,  it  is  established  that 

A 

{F  ( u ) 1 0 < u < T),  T < »,  converges  weakly  to  a Gaussian  process  whose  co- 

Cl  ( 

variance  structure  coincides  with  the  limiting  covariance  structure  obtained 
from  Kaplan-Meier ' s ((1958),  J.Amer.  Stat.  Assoc.,  53,  457-481)  product 
limit  estimator  by  Breslow  and  Crowley  ((1974),  Ann.  Stat.  2,  437-453). 


